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Abstract 

In this paper we consider the model of communication where wireless devices can either switch their 
radios off to save energy (and hence, can neither send nor receive messages), or switch their radios on 
and engage in communication. The problem has been extensively studied in practice, in the setting such 
as deployment and clock synchronization of wireless sensor networks - see, for example, the works of 
McGlynn and Borbash |28|, PalChaudhuri and Johnson [30 1, Moscibroda, Von Rickenbach and Wat- 
tenhofer [26J, or the survey paper of Sundararaman, Buy and Kshemkalyani [37]. The goal in these 
works is different from the classic problem of radio broadcast, i.e. avoiding interference. Here, the goal 
is instead to minimize the use of the radio for both transmitting and receiving, and for most of the time to 
shut the radio down completely, as the radio even in listening mode consumes a lot of energy. Somewhat 
surprisingly, in the theoretical community, this model has not been studied. 

We distill a clean theoretical formulation of this problem of minimizing radio use and present near- 
optimal solutions. Our base model ignores issues of communication interference, although we also ex- 
tend the model to handle this requirement. We assume that nodes intend to communicate periodically, 
or according to some time-based schedule. Clearly, perfectly synchronized devices could switch their 
radios on for exactly the minimum periods required by their joint schedules. The main challenge in the 
deployment of wireless networks is to synchronize the devices' schedules, given that their initial sched- 
ules may be offset relative to one another (even if their clocks run at the same speed). In this paper we 
study how frequently the devices must switch on their radios in order to both synchronize their clocks 
and communicate. In this setting, we significantly improve previous results, and show optimal use of the 
radio for two processors and near-optimal use of the radio for synchronization of an arbitrary number 
of processors. In particular, for two processors we prove deterministically matching (^/n) upper and 
lower bounds on the number of times the radio has to be on, where n is the discretized uncertainty period 
of the clock shift between the two processors. (In contrast, all previous results for two processors are 
randomized, using birthday paradox (e.g. |30|, |26J)). For m = n^ processors (for any positive /3 < 1) 
we prove i2(n'^^^^'''^) is the lower bound on the number of times the radio has to be switched on (per 
processor), and show a nearly matching (in terms of the radio use) 0{n^^^^^l^) randomized upper bound 
per processor, (where O notation hides poly-log{n) multiplicative term) with failure probability exponen- 
tially close to 0. For j8 > 1 our algorithm runs with at most poly-log{n) radio invocations per processor. 
Our bounds also hold in a radio-broadcast model where interference must be taken into account. Again, 
these results are asymptotically superior to all previously proposed algorithms that try to minimize radio 
use. 

While our upper bounds are fairly straightforward (once the problem is properly formulated, which 
we also consider to be an important contribution) the matching lower bounds (even for the case of 
two processors) and nearly-matching lower bound (for the case of multiple processors) are nontrivial. 
We believe the abstract model that we put forward here will be of independent interest and will spark 
additional theoretical research in this setting. 
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1 Introduction 



Motivation: Radios are inherently power-hungry. As the power costs of processing, memory, and other 
computing components drop, the lifetime of a battery-operated wireless network deployment comes to 
depend largely on how often a node's radio is left on. System designers therefore try to power down 
those radios as much as possible. This requires some form of synchronization, since successful commu- 
nication requires that the sending and receiving nodes have their radios on at the same time. Synchro- 
nization is relatively easy to achieve in a wired, powered, and well-administered network, whose nodes 
can constantly listen for periodic heartbeats from a well-known server. In an ad hoc wireless network 
or wireless sensor network deployment, the problem becomes much more difficult. Nodes may be far 
away from any wired infrastructure; deployments are expected to run and even to initialize themselves 
autonomously (imagine sensors dropped over an area by plane); and environmental factors make sen- 
sors prone to failure and clock drift. Indeed there has been a lot of work in this area, see for example: 
||3|5l|6llE[l2l[I3IMl2l]IMl2lllMl2S|2i[3a[3BE^ Many distinct problems are 

considered in these papers, and it is beyond the scope of this paper to survey all these works, however most 
if these papers (among other issues) consider the following problem of radio-use consumption: 

Informal Problem description: 

Consider two (or more) processors that can switch their radios on or off. The processors' clocks are not 
synchronized. That is, when a processor wakes up, each clock begins to count up from 0; however, processors 
may awake at different times. The maximum difference between the time when processors wake up is 
bounded by some parameter n. If processors within radio range have their radios on in the same step, they 
can hear each other and can synchronize their clocks. When a processor's radio is off, it saves energy, but 
can neither receive nor transmit. Initially, processors awaken with clock shifts that differ by at most n time 
units. The objective for all the processors is to synchronize their clocks while minimizing the use of radio 
(both transmitting and receiving). We count the maximum number of times any processor's radio has to be 
on in order to guarantee synchronization. Indeed, as argued in many papers referenced above, the total time 
duration during which the radio is on is one of the critical parameters of energy consumption, and operating 
the radio for considerable time is far costlier than switching radio off and switching it back on. We assume 
that all the processors that have their radios on at the same time can communicate with each other. The goal 
of all processors is to synchronize their clocks, i.e. to figure out how much to add to their offset so that all 
processors wake up at the same time. (We also consider an extension that models radio interference, where 
if more then one processor is broadcasting at the same time, all receiving processors that have their radio 
switched on hear only noise.) 

For multiple processors, we assume that all processors know the maximum drift n, otherwise the adversary 
can make the delay unbounded. It is also assumed that all processors know the total number of processors 
m, although, we also consider a more general setting where n is known for all processors, but m is not. In 
this setting, we relax the problem, and instead of requiring synchronization of all m processors, we instead 
require synchronization of an arbitrarily close to 1 constant fraction of all processors. In this relaxation 
of our model, we require that the radio usage guarantee holds only for those processors that eventually 
synchronize. 

Furthermore, our model assumes that all processors are within radio range of each other, so that the link 
graph is complete. Our techniques can be thought of as establishing synchronization within completely con- 
nected single-hop regions. Clearly, single-hop synchronization is necessary for multi-hop synchronization. 
Our single-hop synchronization protocol, with simple changes, can synchronize a connected multi-hop net- 
work in the sense that (1) two directly connected nodes know one another's clock offsets, and (2) given any 



two nodes in the network v and w, there exists a path vq = v, vi , . . . , v„ = >v where each adjacent pair of nodes 
is connected and synciironized. Thus our central concern in this paper is on establishing lower bounds and 
constructing nearly optimal solutions for the single-hop case. 

Towards Formalizing the Abstract Model: 

To simplify our setting we wish to minimize both transmit and receive cost (i.e., all the times when the 
radio must be "on" either transmitting or receiving). We discretize time to units of the smallest possible 
interval that allows a processor to send a message to or receive a message from another processor within 
radio range. We normalize the cost of transmitting and receiving to 1 unit of energy per time step. (In 
practice, transmission can be about twice as expensive as receiving. We can easily re-scale our algorithms 
to accommodate this as well, but for clarity of exposition we make these costs equal.) We ignore the energy 
consumption needed to power the radio on and to power it off, which is at most comparable but in many cases 
insignificant compared to the energy consumption of having the radio active. This is the model considered, 
for example, in |[28l l26l \5\ |6l [34l [3T1- For the purposes of analysis only, we assume that there is global 
time (mapped to positive integers). All clocks can run at different speeds, but we assume that clock drifts 
are bounded; i.e., there exists a global constant c, such that for any two clocks their relative speed ratio is 
bounded by c. Now, we define as a time "unit" the number of steps of the slowest clock, such that if two of 
the fastest processors' consecutive awake times overlap by at least a half of their length according to global 
time, then the number of steps of the slowest clock is sufficient time for any two processors to communicate 
with each other. (See appendix for further discussion.) We formahze this more precisely in the definition of 
our model. 

Our Formal Model and Problem Statement: 

Global time is expressed as a positive integer, m processors start at an arbitrary global time between 1 
and n, where each processor starts with a local "clock" counter set to 0. The parameter n refers to the 
discretized uncertainty period, or equivalently, to the possible maximal clock difference, i.e., to the maximal 
offset between clocks; hence, we will use these terms interchangeably. Both global time and each started 
processor's clock counter increments by 1 each time unit. The global clock is for analysis only and is not 
accessible to any of the processors, but an upper bound on n is known to all processors. Each processor 
algorithm is synchronous, and can specify, at each time unit, if the processor is "awake" or "sleeping." (The 
"awake" period is assumed to be sufficiently long to ensure that the energy consumption of powering the 
radio on and then shutting it off at each time unit is far less than the energy expenditure to operate the radio 
even for a single time unit). All processors that are awake at the same time unit can communicate with 
each other. (Our interference model changes this so that exactly two awake processors can communicate 
with each other, but if three or more processors are simultaneously awake, none of them can communicate.) 
The algorithm can specify what information they exchange. The goal is for all m processors to adjust their 
local clocks to be equal to each other, at which point they should all terminate. The protocol is correct if 
this happens either always or if the protocol is randomized with probability of error that is negligible. The 
objective is to minimize, per processor, the total number of times it's radio is awake. 

We remark that the above model is sufficiently expressive to capture a more general case where clocks at 
different nodes run at somewhat different speeds, as long as the ratio of different speeds is bounded by a 
constant. We formally prove this fact in the appendix (see the first section in the appendix). 

Our Results: 

We develop algorithms for clock synchronization in radio networks that minimize radio use, both with and 
without modeling of interference. In particular, our results are the following. 



1. For two processors we show a D.{^/n) deterministic lower bound and a matching deterministic 0{y/n) 
upper bound for the number of time intervals a processor must switch its radio on to obtain one-hop 
synchronization. 

2. For arbitrary m = n^ processors, we prove Q.(^'^^ is the lower bound on the number of time inter- 
vals the processor must switch its radio for any deterministic protocol and show a nearly-matching (in 
terms of the number of times the radio is in use) O (^n^ • poly-log randomized protocol, which 
fails to synchronize with probability of failure exponentially (in n) close to zero. Furthermore, our up- 
per bound holds even if there is interference, i.e., if more than one processor is broadcasting, listening 
processors hear noise. 

3. It is easy to see that processors could not perform synchronization if n is unknown and unbounded, 
using standard evasive argument. However, if n is known, we show that | (or any other constant 

fraction) of the processors can synchronize without knowing m, yet still using 0{n~^ • poly-log («)) 
radio send/receive steps, with probability of failure exponentially close to zero. 

We stress that while the upper bound for two processors is simple, the matching lower bound is nontrivial. 
This (with some additional machinery) holds true for the multi-processor case as well. 

Comparison with Previous (Systems) Work: 

Tiny, inexpensive embedded computers are now powerful enough to run complex software, store significant 
amounts of information in stable memory, sense wide varieties of environmental phenomena, and commu- 
nicate with one another over wireless channels. Widespread deployments of such nodes promise to reveal 
previously unobservable phenomena with significant scientific and technological impact. Energy is a fun- 
damental roadblock to the long-lived deployment of these nodes, however. The size and weight of energy 
sources like batteries and solar panels have not kept pace with comparable improvements to processors, and 
long-lived deployments must shepherd their energy resources carefully. 

Wireless radio communication is a particularly important energy consumer. Already, communication is ex- 
pensive in terms of energy usage, and this will only become worse in relative terms: the power cost of 
radio communication is fundamentally far higher than that of computation. In one example coming from 
sensor networks, a Mica2 sensor node's CCIOOO radio consumes almost as much current while listening for 
messages as the node's CPU consumes in its most active state, and transmitting a message consumes up to 
2.5 times more current than active CPU computation |[33l . In typical wireless sensor networks, transmitting 
is about two times more expensive than listening, and about 1.5 times more expensive than receiving, but 
listening or transmitting is about 100 times more expensive as keeping the CPU idle and the radio switched 
off] (i.e., in a "sleep" state). 

Network researchers have designed various techniques for minimizing power consumption l5l|6l|34l- For 
example, Low-Power Listening OTl trades more expensive transmission cost for lower listening cost. Every 
node turns on its radio for listening for a short interval T once every interval n > T. If the channel is quiet, the 
node returns to sleep for another n; otherwise it receives whatever message is being transmitted. To transmit, 
a node sends a preamble of at least n time units long before the actual message. This ensures that no matter 
how clocks are offset, any node within range will hear some part of the preamble and stay awake for the 
message. A longer n means a lower relative receive cost (as r/n is smaller), but also longer preambles, and 
therefore higher transmission cost. 

'Example consumption costs: CPU idle with clock running and radio off ("standby mode"), 0.1-0.2 mA (milliamps); CPU on 
and radio listening, 10 mA; CPU on and radio receiving, 15 mA; CPU on and radio transmitting, 20-25 mA. 



A more efficient solution in terms of radio use was proposed by PalChaudhuri and Johnson f30l, and further 
by Moscibroda, Von Rickenbach and Wattenhofer l.26il . The idea is as follows. Notice that in the proposal 
of [,31] . the proposal was for a transmitting processor to broadcast continuously for n time units, while 
receiving processors switch their radios on once every n time units to listen. Even for two processors, this 
implies that total use of the radio is « + 1 time units (i.e., it is linear in n). The observation of |[30ll26i is 
that we can do substantially better by using randomization: if both processors wake their radios 0{y/n) time 
units at random (say both sending and receiving), then by birthday paradox with constant probability they 
will be awake at the same time and will be able to synchronize their clocks. (As indicated before, we show 
instead a deterministic solution to this important in practice problem, and a matching lower bound.) 

Comparison with Radio Broadcast: 

Usually, in a broadcast setup, a node is able to receive a message if and only if it does not transmit, and 
there is one and only one of its neighbors that transmits, at that time. In the case when nodes are not 
able to detect collision, 131 13 showed randomized protocols. A deterministic broadcast algorithm, with 
work time 0{n^^l^), has been given in [10|. The improvements of these algorithms have followed |22j|: 
for undirected radio network graphs, with diameter D, for randomized broadcast the expected work time 
has been 0{D\og{n/D) +\o^n), while for deterministic broadcast the expected work time has been 
^(" iog'(n/'g) )• ll23l . a faster algorithm for directed radio network graphs has provided running time 
0{n\ogn\ogD). Additionally, other algorithms for broadcast ifTTl [151 [TtI [T9l |2T1 as well as for clock syn- 
chronization |[29l [32l [T2I m have been proposed. The work of radio broadcast is different from the problem 
we address at this paper. However, as we mention in the technical description, once we resolve the problem 
of meeting times, we can easily combine our solutions with radio broadcast goal to avoid interference. 

High-level ideas of our constructions and proofs 

• For two processor upper bound, we prove that two carefully chosen affine functions will overlap no 
matter what the initial shift is. The only technically delicate part is that the shift is over the reals, and 
thus the prove must take this into account. 

• for the 2-processor lower bound, we show that for any two strings with sufficiently low density (of 
I's) there always exists a small shift such that none of the I's overlap. This is done by a combinatorial 
counting argument. 

• for multiple processors, the idea of the lower bound is to extend the previous combinatorial argument, 
while for the upper bound, the idea is to establish a "connected" graph of pairwise processor synchro- 
nization, and then show that this graph is an expander. The next idea is that instead of running global 
synchronization, we can repeat the same partial synchronization logarithmic number of times (using 
the same randomness) to yield a communication graph which is an expander. We then use standard 
synchronization protocol over this "virtual" expander to reach global synchronization. 

• for handling interference, we observe that standard "back-off protocols |ili|91 can be combined with 
previous machinery to achieve non-interference, costing only poly-logarithmic multiplicative term. 

• for the protocol that does not need to know m, (recall that m is the total number of processors within 
radio-reach), we first observe that if m > n, by setting m = n our protocol already achieves synchro- 
nization with near-optimal radio use. The technical challenge is thus to handle the case where m <n 
but the value of m is unknown to the protocol. Our first observation is to show that processors can 
overestimate m, in which case the amount of energy needed is much smaller (per processor) than for 
smaller m, and then "check" if the synchronized component of nodes has reached current estimate on 



m. If it did not, than our current estimate of m can be reduced (by a constant factor) by all the proces- 
sors. To assure that estimates are lowered by all the processors at about the same time, we divide the 
protocol into "epochs" which are big enough not to overlap even with a maximal clock drift (of n). 
Summing, the energy consumption is essentially dominated by the smallest estimate of m, which is 
within a constant factor of correct value of m, and all processors that detect it stop running subsequent 
(more expensive) "epochs". 

2 Mathematical Preliminaries 

Lemma 1 (Two-Color Birthday Problem) For any absolute constant C > a/1 — InO.l « 1.8173 and any 

positive s,t (0, 1), where s + t = 1, the following holds: r = Cn" identical red balls and b = Cn^ identi- 
cal blue balls are thrown independently and uniformly at random into n bins. Then, for sufficiently big n, 
probability that there is a bin with both red and blue balls is > 0.8. 

Proof Outline 

We first note that since all balls are thrown independently and uniformly at random, it follows that throwing 
all of r + b balls together uniformly at random, is equivalent to the scenario of first throwing r red balls, 
then throwing b blue balls. Thus, we first throw r red balls, and count the number of unoccupied bins. Let us 
mention that we keep s and t fixed (while n grows). Using the Theorem 1 in [20J the number of unoccupied 
bins is &{ne^''^") with probability > 1 — exp(— n*). Then we throw b blue balls u.a.r. into n bins. Given that 
the number of unoccupied bins after throwing red balls is &{ne^''^"), with probability aforementioned, it 
follows that every blue ball independently hits a bin with a red ball with probability > 1 — @{ne^''^").Novj, 
applying the Chernoff bound on the number the blue balls hitting the bins with the red balls, the proof of the 
lemma follows. The formal proof is given in the Appendix. 

3 Lower Bounds 

Recall that n is the maximum offset between processor starting times and m = n^ is the number of processors. 
Assume that each processor runs for some time L. Its radio schedule can then be represented as a bit string 
of length L, where the ith bit is 1 if and only if the processor turned its radio on during that time unit. We first 
consider the two-processor case. Recall that in our model maximal assumed offset is at most n. If we take 
2 bit strings corresponding to the two processors, the initial clock offset corresponds to a shift of one string 
against the other by at most n positions. Note that if we set L > An, the maximal shift is at most n < L/4. 

Note: In the next sections, without loss of generality we apply ceiling function to any real number, e.g., L", 
L/C^ are treated as [L"], [L/C^], respectively. 

To prove our lower bound, we need to prove the following: for any two L-bit strings with at most \/L/C 
ones in each string (for some constant C > l/\/2), there always exists a shift < L/4 of one string against 
another such that none of the ones after the shift in the first string align with any of the ones in the second 
string. In this case we say that the strings do not overlap. W.l.o.g., we make both strings (before the shift) 
identical. To see that this does not limit the generality, we note that if the two strings are not identical, we 
can make a new string by taking their bitwise OR, what we call the "union" of strings. If the distinct strings 
overlap at a given offset, then the "union" string will overlap with itself at the same offset. 



Lemma 2 (Two Non-Colliding Strings) For any absolute constant C > 1/ \/2, and for every L-bit string 
with I < ^ ones, there is at least one shift within L/(2C^) such that the string and its shifted copy do not 
overlap. 

Proof For a considered L-bit string, let us denote the positions of I ones a.?, \ < a\ < a2 < ■ ■ ■ < ae < L. 
Let us consider the set of differences Si = {aj — at : I < i < j < i}. The cardinality of that set is at most 
1^1 < (2) = ^(^— l)/2 < L/(2C^) (some of differences may be equal). All differences are greater than zero 
since a, are mutually different, and S C {1,2, . . . ,L — I}. Let d G {1,2, ... , [j^?!} minimal integer 

such that d ^ S. That integer exists since \ S\ < [ 2§il ■ ^'^^ "^^e set of positions of ones A = {ai,a2, ■ ■ ■ ,ai}, 
let the shifted set be A + 5 = {a,- + 5\ai e A}. The sets A and A + 5 do not intersect since 5 > 1, that is 
a,- / a,- + 5, and by construction, since 5 ^ ^ it follows a, aj + 5 for any a,-, ay G A. This proves the 
lemma. ■ 

Next, we want to prove a general lower bound for multiple strings. The high-level approach of our proof is 
as follows. We pick one string, and then upper bound the total number of ones possible in the "union" of 
all the remaining (potentially shifted) strings. If we can prove that assuming the density of all the strings is 
sufficiently small, and there always exists a shift of the first string that does not overlap the "union" of all 
the remaining strings, we are done. The "union" string is simply a new string with a higher density. 

Lemma 3 (General Two Non-Colliding Strings with Different Densities) Let s,t >0 such that s + t < 1, 
and let C > 1. For two L-bit strings such that the number of ones in the first string is a = L" /C, and the 
number of ones in the second string is b = L' /C, there is a shift within L/C^ + 1 such that the first string and 
the shifted second string do not overlap. 

Proof Let the positions of ones in the first string be P = {p\,p2,...,Pa}, and the positions of ones in the 
second string be Q = {qi,q2, ■ ■ ■ ,qb}. Let us consider the set of differences S = {p — q \ p £ P,q £ Q}. The 
cardinahty of that set satisfies \S\ < \P\\Q\ = ab = U+* /C^ < L/C^- < L. 

Similarly to the proof of the Lemma|2l let us choose 5 G {0, 1,2, ... , [L/C^] + 1} such that 5 ^ S. That 
integer exists since \S\ < L/C'^. Now, it follows that P and Q-\- 8 = {q-\- 5 \ q ^ Q] Ao not intersect, since 
by construction p ^ q + 5 for any /? € P and q £ Q. This proves the lemma. ■ 

Here, w.l.o.g., we considered only "left" shift. If we needed both left and right shift, then we would have an 
additional factor of 2, and would need both p — q and q — p in S. The proof of the previous lemma would 
follow by the same argument. Using the Lemma|3l the lower bounds immediately follow. 

Theorem 4 There exists an absolute constant C > 1, such that for any n^ strings of length L with at most 
^(i-/^)/2 in each string, there always exists a set of shifts for each string by at most L/4 such that no 
string's ones overlap any of the ones in all the other strings. 

Proof Set a = (l — j8)/2. Add strings sequentially and for each find a shift that does not overlap with (the 
union of) all the shifted previous strings. The Lemma [3] applies since the smaller string has density n", and 
the "union" of all the previous strings has density of at most n^ -n"". This density is at most n, since the sum 
of the exponents a + j8 = (l+j8)/2isat most 1, what proves the theorem. ■ 



4 Matching Upper Bound for Two Processors 



We now show the upper bound. That is, we give the deterministic algorithm for two devices. In particular, 
for any initial offset of at most n, we show a schedule where two processors meet with probability equal to 
one inside a "time- window" of length W = 2n + 4a/?i + 2. 

Theorem 5 For any n, there exists a string of length W = 2n + \^ + 2 with at most 2y^ + 2 ones such 
that this string will overlap itself for all shifts from 1 to n. 

Proof Let us define the string S of the length W , that has ones at the following positions (from the 
perspective of its local clock): Set the bits at positions {i^/n + i) and {i^/n) to 1, for / G {1, ... , [2y^ + 2J }; 
Set the remaining bits to 0. 

For the analysis, we consider the "global" clock. Further, we consider two strings A and B, being the shifted 
versions of the string S, shifted by a^ and b^, respectively. (Both a^ and are < n, by the conditions of the 
Theorem [5]) Since the string S is deterministically defined, we know the exact appearances of ones in the 
strings A and B. Thus, from the global clock point of view, in the strings A and B, respectively, ones appear 
during the following time intervals: 

[[ai,\, [a,-, J + 1], [La,-J , + 1] and [[^-J , [^-J + 1], [[^2] , [K\ + 1]> 
where the values a,-, , a,^ , bi^ , bi^ are given by 

= ao + i\/n + i, 
^i^i = + i^/n 
bi,i = bo + iy/n + i, 
bi,2 = h) + iVn, 

for / e { 1 , . . . , [2y/n + 2J }. The initial values of strings are: ao.i = fl'0,2 = and bo^i = bo,2 = bo- We show 
that there exist integers i,je{0,l,..., [2y^+2J}, such that for some 51,^2 £ {1,2} the following is satisfied 

5 = -fty,,V2| < 1. (1) 

Although the schedule we propose may look simple, in general i/i is not an integer, thus we have to perform 
the precise analysis below. 

We will widely use the following property of the floor function and the fractional part for the reals. For any 
x&'R,x= [x\+ {x}, where {x} G [0, 1) is the fractional part, and [x\ G Z is the floor function, the following 
is satisfied 

[.xj - 1 < X- 1 < [xj < X < [xj + 1 < x+ 1. 

Now, we explicitly construct ij such that Eq. ([T]) is satisfied, that is, a,-,s, = bj^s^_ it 5 for some fractional 
part 5 G [0, 1) and for some si,S2 G {1,2}. Let us call the absolute difference between initial values of the 
strings A = |ao — ^o|- Since the protocol is symmetric in A and B, w.l.o.g., let us first consider ao — bo> 0. 
From ao,bo G {0, 1, . . . ,n}, it follows A<n. We consider the following three cases. 

Case: ao — ^0 = 0. It follows ao = bo and Eq. ([T]) is satisfied for / = j = 0. 



Case; 



^ <ao — bo<n. Let q = ■ Then we have: 



< 



A 



< 



A n r- 



+ 1, 



q < -^<q+l, 

q^/n < A<q\/n + \/n. 

Now we give the exact / and j where the strings meet. Consider / and j given by 

i = L(? + l)v^J-A, 
j = i + q+l. 



Then we have: 

= [{q + l)^/n\ — A< {q + ^)^/n — A = q^/n — A + ^/n < ^/n, 
j = q+\ + i<^/n + \ + ^/n = 2^/n + \. 

From above, it follows that since A is an integer, A < [{q+l)\/n\ ^ / > ^ j >0. Substituting the values 
for i and j, we get 5 = |a,-,i — bj^2\ = {{q+ ^ [Oj !)• 

Case: —n<ao — bo<—l. In this case, for the previously defined i,j,A, it follows \aj^2 — bi,\ \ € [0, 1). 

Finally, for any ao,^o £ {0) 1, • • • there are /,y such that the shifted strings meet. Since maxij{ai,bj} < 
n + (2-y/« + 2) + (l-^n + 2) = 3n + 4 + 2, and subtracting n, the length of the strings, it follows that 
the strings meet with the probability equal to 1 inside the time- window of the length W = 2n + 4-y/n -|- 2. 
This completes the proof. ■ 



5 Upper Bound for m Processors 

In this setting we have m = nP processors (and as before the maximum shift is at most n). We first state our 
theorem: 

Theorem 6 There exists a randomized protocol for n^ processors ( which fails with probability at most 
1/2°^"^} such that: 

1. ifP< \ the protocol is using at most o{n~^ ■ poly-log (n)^ radio steps per processor, and 
2- if P >i using at most O (poly-log (n)) radio steps per processor 



Furthermore, the same bounds hold for the synchronization in the radio communication model, where a 
processor can hear a message if one (and strictly one) message is broadcasted. 



First, we give a high-level outline of the construction of our algorithm for j3 G [0, 1). For the case of j3 > 1 
we only need Steps 4 and 5, see below. The formal analysis and proofs of the Main Algorithm are given in 
the Appendix. 

Outline of the Main Algorithm: 

Step 1. We let each processor run for L = 4n steps, waking up during this time O (^"^) times uniformly at 
random. It is important to point out that each processor uses independent randomness. We view it as m- 
row and L {L>W + n) column matrix A (taking into account all the shifts), where W = 2n + A^fn + 2 
is defined in Theorem [5] Fix any row of this matrix (say the first one). We say that this row "meets" 
some other row, if 1 in the first row also appears (after the shifts) in some other row. If this happens, the 
first processor can "communicate" with another processor. We show that for a fixed row, this happens 
with a constant probability. 

Step 2. Each processor repeats Step 1 (using independent randomness) C?(logm) times. Here, we show that 
a fixed row has at least O(logm) connections to other rows (not necessarily distinct) with probability 
greater than 1 — 1 /poly(m). 

Step 3. From Step 2. we conclude that the first row meets at least a constant number of distinct other rows 
with probability greater than 1 — 1 /2m. 

Step 4. We use the union bound to conclude that every row meets at least constant number of distinct other 
rows with probability greater than 1/2. If we repeat this process a logarithmic number of times, we 
show that we get an expander graph with overwhelming probability (for the definition of an expander 
see 1271 ). Thus, considering every row (i.e., every processor) as a node, this represents a random 
graph with degree of at least at least constant number for each node, which is an expander with high 
probability. 

Step 5. During the synchronization period, a particular processor will synchronize with some other processor, 
without collision, by attempting to communicate whenever it has a 1 in its row. (In case of interference, 
the processor can communicate if only one other processor is up at this column, which we can achieve 
as well, using standard "back-off' protocols (11 0, costing only poly-logarithmic multiplicative term.) 

Step 6. The processors can now communicate along the edges of the formed expander (which has logarithmic 
diameter) as follows. The main insight that we prove below is that if processors repeat the same ran- 
dom choices of Step 1 through Step 5, the communication pattern of the expander graph is preserved. 
Hence, the structure developed in Step 2 can be reused to establish a logarithmic-diameter (in m) 
spanning tree and synchronize nodes with 7?o/y-/ogarithmetic overhead (using known machinery over 
this "virtual" graph). We show in the appendix, using standard methods, that communicating over the 
implicit expander graph to synchronize all nodes can be done in D + 2 steps, where D is the diameter 
of the expander. 



6 Protocol That Does Not Need to Know the Number of Processors 

Suppose our processors know the offset n but not the number of all processors in the system, that is, m. The 
main observation here is that once we make a spanning tree of the graph, each node can also compute the 
number of nodes in its spanning tree. Hence, we can make an estimate of m and then check to see if this 
estimate is too big. Thus, until the right (within a constant factor) estimate is reached, all nodes will reject 



the estimate and continue. Adjusting constants appropriately, we can guarantee that an arbitrary constant 
fraction of the processors will terminate with the right estimate of m (within some fixed constant fraction). 
The algorithm for the estimation of m is as follows. 

Algorithm: Estimation of m 
El. Setj = 0. 

E2. Build a spanning tree using the Main Algorithm (from the previous section) for m,- = n/2' and count 
the number of nodes in the tree. If the number of the nodes in the tree is less than m, then set i:=i+\ 
and go to step E2. 

E3. Output m,-. 
End of Algorithm: Estimation of m 

Theorem 7 Any constant fraction of the processors can synchronize without knowing m, yet still use 
0{n~ poly-log (n)) radio send/receive steps (with probability of failure exponentially close to zero). The 
bound on the radio use holds only for processors that synchronize. 

i-ft 

Proof We showed that the used power is 0{n^~ poly-log {n)) for a particular number of processors m, = 
n^'. Let us consider m,- = n/2'. Since a, = (1 —^i)/2, it follows n"' = ^Jnjmi = 2'^^. Letimwc = \iog{n/m)]. 
Then the total power, used in the protocol that does not know m, is 

hnax ^max ^max 

£ 0(n«'poly-log(n)) = £ 0(2'/2poly-log(n)) = £ 2'/2o (poly-log (n)) 

(=0 (=0 i=0 

= 0(2'"-/2)(5(poiy.iog(n)) = 0(n(i-^)/2poly-log(n)), 
since 2'"""/^ = O^^fnjm) = 0{n^). ■ 



7 Conclusions and Open Problems 

In this paper, we have studied an important problem of power consumption in radio networks and com- 
pletely resolved the deterministic case for two processors, showing matching upper and lower bound. For 
multiple processors, we were able to show a poly-logarithmic gap between our randomized protocol and 
our deterministic lower bound. However, this is not completely satisfactory. Our lower bound holds only 
for deterministic protocols, while our upper bound in multi-processor case is probabilistic (unlike the two- 
processor case, where our upper bound is deterministic as well). Closing this gap remains an interesting 
open problem. 

Another interesting question is the following. It is important to note that in radio communication, conserva- 
tion of power can be achieved in two different ways: one approach is to always broadcast the signal with the 
same intensity (or to power down radios completely in order to save energy); this is what we explored in this 
paper. The second approach is the abihty for a radio to broadcast and receive signals at different intensity, 
the stronger the signal the further it reaches. In the case where all processors are at the same distance from 
each other, this is a non-issue (i.e., our single-hop networks, the main focus of this paper). However, for 
multi-hop networks the question of optimal power-consumption strategies with varying signal strength is 
completely open. 
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Appendix 

A Proof of Lemma [I 

Proof We first note that since all balls are thrown independently and uniformly at random, it follows that 
throwing all r + b balls together uniformly at random, is equivalent to the scenario of first throwing r red 
balls, then throwing b blue balls. Thus, we first throw r red balls, and count the number of unoccupied bins. 
Let Z be a random variable, denoting the number of empty bins, after throwing r = Cn" balls into n bins, 
u.a.r. The expectation of Z is 

n 

By the occupancy bound. Theorem 1 in ||20l . for any > 0, the tail of Z is given by 



Pr 



[|Z-E[Z]| > 0E[Z]] <2exp(^- 



02E[Z]2(n- l/2)\ 



-E[Z]2 



Now, let us throw b blue balls into n bins, u.a.r. We have that some of these n bins have been previously 
occupied with red balls. Let us denote the event that exists at least one bin with both red and blue balls. 
(The goal is to show Pr[J^] > 0.8 for the assumptions given in the lemma.) Given Z = z, the probability 
that one blue ball does not hit a bin with a red ball is z/n. Then it follows Vr[M'\Z = z] = 1 — {zjnf , and 
furthermore, 

Pr[^|Z < (1 + e)E[Z]] > 1 - ((1 + e)E[Z]/?^)^ 

where we will appropriately chose d = o(l), later. We now use the formula of the total probability, and the 
bound on Z, to obtain a lower bound on the probability of M' 

Pr[J^] = Pr[^|Z< (l + 0)E[Z]lPr[z< (1 + 0)E[Z] 



+ Pr 
> Pr 



|Z > (1 + 0)E[Z] Pr Z > (1 + 0)E[Z] 
|Z< (l + 0)E[Z]lPr[z< (1 + 0)E[Z] 



>- ('H(..a)E,z,/„)0(i-..p(-^^f!;|^)) 

. (,-„.efa-ir)(.-.e.p(-«g/?))) 

Let us choose = 1/fc = 1 /{Cn'). The goal is to obtain an lower bound on Pr[Jf ], sufficiently close to 1, 
thus we discuss the following terms. First, 



i-(i + er(i 



n b 



> \-e 



i-c^ 



where we have used (1 + <e and (1 — 1/n)" < 1/e, for every n. (Furthermore, the sequences (1 + 
1 /n)" and (1 — 1 /«)" are both increasing, with the hmits e and \/e, respectively.) Second, let us consider the 
term ^ ^z-e"^^^^^ ' following: (i) n — 1/2 > n/2, for any positive integer n; (ii) Bernoulli's 

inequality (1 + > 1 + rx forx > — 1 and r>\. Thus, 



e^E[Z]^(n-l/2) 
n2-E[Z]2 



> 



> 



(n/E[Z])2-l 
e'^n {\-\/nf' 
2 l-(l-l/n)2'- 
02„ (1 - \/n 



using the expression for E[Z] 



\2r 



2 2r/n 
n {l-l/nf'- 



by Bemoulh's inequahty 



we already chose 6 



2b^ 2r/n 

= (1 ~ 1 7")^'^ t)y definition for r and Z?, it follows rb = C^n 
In order to find an lower bound on the last expression, let us consider the logarithm of it. 



That is. 



= -ln(4C'*) + hir + 2rhi(l-l/n) 

> -ln(4C'*) + lnr + 2r(-2/n) by the fact hi(l - i) > for n > 2 

= -ln(4C'^) + lnr- — 
n 

4C 

= — ln(4C )+jlnC + jlnn from definition r = Crf 

> ^ Inn for sufficiently large n 



> l-exp(-n^/2)_ 



(2) 



Thus, for sufficiently large n, we obtain the lower bound on the probability of 
Pr[Jif] > (l-e^-^')(l-2exp(-?i-^/2)) 

= l-e^-^' -2exp{-n'^^)+2e^-^'exp{-n'/^). (3) 

For given £ G (0, 1), if both e/2> e^^^' and £/4 > exp(-?i*/2), then Eq. © imphes Pr[^] > 1 - £. These 
two conditions are equivalent to 

and 

Finally, we obtain 

Pr[^] > l-£. 

Specifically, let C > -InO.l w 1.8173, that is 1 - e^^^~ > 0.8. Then for sufficiently big n, it follows 
Pr[J^] > 0.8, which completes the proof of lemma. ■ 



B Proof of Correctness of the Main Algorithm 



Proof Analysis of Step 1. We generate a random matrix A as follows. 

Definition 8 (Generation of tlie Random Matrix A) For each row of the random matrix A, inde- 

pendent of the content of other rows, we uniformly and independently generate CL"" integers ,f2, • . • ,fcL« S 
{1,2, .. . ,L}, where a, j8 G [0, 1]. Each ti corresponds to one energy unit for the unit time tj, of that row (note, 
ti's are not necessarily different, and the sum of each row is CL'^). 

Lemma 9 Let A be the matrix (given by the Definition [S]), such that 2a + j3 = 1. Let us consider one 
particular row from A. That row "meets" with some other row with the probability > 0.8. 

Proof In the Section [3l we used L = An. Each row in the matrix A has CL" = Cn" ones, denoting the 
constant C = 4"C. 

Let us consider a particular row of the matrix A, w.l.o.g. let it be the first row. The first row has C?i" ones, 
what we call the blue balls. Let all of the remaining n^ — \ rows be collapsed into one row, what we call the 
"collapsed row", and each entry of this collapsed row will represent one of n bins. The number of ones in 
the collapsed row is 

(«/^-l)Cn«=n«+'^C(l-^). 

nP 

The ones in the collapsed row, we call the red balls. 

Since the positions of the balls in each row are generated randomly, independently and uniformly, also row 
by row independently, it follows that the balls in the collapsed row are generated independently, uniformly 
at random. That is, the process of throwing the first C«" red balls into "collapsed bins", then the second 



Cn^ (red) balls, and so onnP — \ times, is equivalent to throwing n"^^C{\ — /rfi) red balls into n bins, 
independently, uniformly at random, all at once. Now, by the Lemma [T] (Two-Color Birthday Lemma) it 
follows that for a + (a + j8) = 2a + j8 = 1, that particular (the first) row "meets" with some other row with 
the probability > 0.8. ■ 



Analysis of Step 2. We repeat Oilogm) times the procedure "Generation of the random matrix A." Such 
constructed matrices concatenated to each other form the matrix A of the dimension x ©(Llogm). 

For the sake of brevity, let us call M ■.= m — \. Let us generate / = KXogM random matrices A^^' , . . . ,A'^^\ 
where A' is a constant to be determined later in the analysis of the Step 4. That is, let us repeat / times the 
procedure "Generation of the random matrix A." We prove that each row in the matrix A has 0.4A'logM 
"meetings" with the probability > 1/2. Again, w.o.l.g. let us consider the first row. Let X,- be a random zero- 
one variable, indicating that the first row "meets" with some other row in the matrix A^'', for / = 1, . . . , /. The 
variables Xj are independent Bernoulli trials, since matrices A''' are generated independently. By the Lemma 
|9]it follows Pr[X,- = 1] > 0.8. Let T = L'^i^,- be the number of these "meetings". The following is satisfied 

E[r] = £E[X,] > 0.8/ = Q.%K\ogM. 

i=l 

Applying the Chernoff Bound on T it follows, 

Pr[r < (1 - 5)E[r]] < exp(-E[r]5V2) < 
Taking 5 = 1/2 we have the following bound on T 

Pr[r < OAKXogM] < Vr[T < E[r]/2] < g-O.i^^iogM ^ ^-o.i^:_ 

Analysis of Step 3. Here, we prove that each row in the matrix A "meets" with at least a constant number 
of different rows with the probability > 1 /2. We specify this constant later. (Furthermore, the meetings are 
chosen independently with replacement.) Again, w.o.l.g. we consider the first row. We have that the number 
of "meetings" is T > 0.4ii:iogM with probability at least 1 - M""-'^. 

We see this experiment as the throwing of T balls, one by one independently, into M bins. Let F,- be a binary 
random variable, such that, 7, = 1 if and only if the number of already occupied bins is increased by one, 
with the /th thrown ball, otherwise 7, = 0. Note that the variables Yi are not independent, so that we cannot 
apply the Chernoff bound. We have Pr[y;- = 1] > 1 - (/ - 1)/M > 1 - T/M, for / = 1 , . . . , T. Let us denote 
Pr[F; = 1] = Pi and Pr[y;' = 0] = qt. The following is satisfied pi < 1, and qi < T /M for every /. Let the 
value Y = YJ^i Yi be the number of occupied bins after throwing all T balls. 

At this step, we specifically choose the number 10, to be a constant number of different rows that one row 
in A "meets". We do that wlog, since the proofs below apply, by simple changing the number 10 with any 
other positive constant, and taking T to be that constant plus 2. 

Now we show that Pr[F < 10] < M"^ 

10 10 

Pr[y < 10] = £ £ n^'- U ^J<L I (T/M)^-' 

k=Ql:lG[T],\I\=kieI je[T]\I k=Q I:Ic[T],\I\=k 

10 /T^X 10 rrT T^T 10 1 

/t=0 V/ k=0'^-^^^ -"^ k=Q'^- 



For T > 0.4i«:iogM > 12 it follows 



eT^/M^-^^ = exp(l + TlogT - {T - 10)logM) < exp(-21ogM) = M"^, 
what completes the proof of the Step 3. 

Analysis of Step 4. Taking K such that O.IA' > 1 and KlogM > 30, by the union bound, from the analysis 
of the previous steps, it follows that 

Pr[the first row has at least 10 different "meetings"] > 1 - (1/M°-^^ + 1 /M^) 

1 

^ ^~2(M+1) 
^ 1 
2m 

Finally, by the union bound applied over all of m rows, it follows that every row has at least 10 different 
"meetings" with probability > 1 /2, that is, 

Pr [every row has at least 10 different "meetings"] > 1/2. 

The matrix A uniquely defines a random graph G, for which we show that has a minimum degree is at least 
10, with the probability > 1/2. Here we define the undirected graph G = {V,E) that corresponds to the 
random matrix A. 

Definition 10 For a graph G = {V,E), the set of nodes V = {vi,V2, . . . ,v,„} corresponds to the set of rows 
of the matrix A (i.e., to the set of devices). For \ < i < j < m the edge {i,j) E E, if and only if there is a 
column t in the matrix A such that Ai^t 7^ and Ajj ^ (I <t <n). 

Let G = {V,E) be the graph with the set of nodes V and the set of edges E = U-^jS*^'' obtained as the union 
of edges corresponding to the matrices A^'\ for / = 1, . . . ,/. We have just proven that with probability > 1/2 
every vertex in the graph G has degree > 10. Finally, we note that we can repeat this entire process another 
poly-log(?2) times to guarantee a success probability exponentially close to 1. 

Analysis of Step 5. A fixed row in a matrix A will meet with some other row, without collision, with 
probability > 1 - 0.4®(i°g'") = \-m-®^^\ In case of interference, use standard back-off protocol analysis, 
with O(log^m) multiplicative overhead. 

Analysis of Step 6. We recall that if we have a random graph with node-degree at least a constant, then we 
can use the following theorem: 

Theorem 11 (Bollobas, de la Vega fT^) A random d-regular graph on m nodes has diameter (logm + 
loglogm)/log((i — 1) +c, for same small constant c < 10. This is the best possible since any d-regular 
graph has diameter at least log m/ log (t/ — 1). 

In the graph G each node has a degree of at least d (we have specifically chosen, wlog, d to be 10). Further- 
more, by our construction, the edges are independent. It follows that the diam{G) is at most the diameter of 
a random d-vtg\x\w subgraph. That is, diam{G) < O(logm) with high probability. 



Definition 12 We generally say that an mx n zero-one matrix B = (bij) is associated to an undirected 
graph G = {V-jE) if and only if: the set of nodes is V = [m], and between two nodes i ^ j there is an edge 
{i,j) £ E if and only of if there is a column t £ [n] in the matrix B such that bi^ = bj^t = 1. ond bk,t = 0/or 
all k £ [n]\ {i,j}. We also say that the graph G is associated to the matrix B. 

Every processor / G [m] generates O(nlog^m) random variables C\,C2,.-- ^^oiniog^m)' ^'^P^^'^i'^g O(log^m) 
times the procedure (Generation of the Random Matrix A) (see Definition [8]l. That is, / randomly generates 
a string of the length n, with exactly = n^^^Py^ ones, while the rest of the entries are zeros. That string 
is mapped onto €[,€'2,. ■ ■ ,C'„. Then /, independent of the previous outcomes, repeats (Generation of the 
Random Matrix A ) for the next n variables C,*,^ j , C^_|_2 ; • • • > , and so on; totally repeating O (log^ m) times 
the procedure (Generation of the Random Matrix A). 

We define the zero-one matrix A, such that Ajj = C'j, for 1 < / < m, 1 < j < 0{n\o^ m); that is, the ith 
row corresponds to the coin outcomes of the ith processor. According to the way the random matrix A of 
the size m x 0(«log^m) is created, it can be divided into O(nlogm) blocks of the matrices A's, each of the 
size m x 0(?ilogm). Finally, each of these A's matrices, can be subdivided into 0(«logm) blocks of the 
matrices A's, each of the size m x n. For matrices A, A, A, let the associated graphs be G, G, G, respectively. 
In the Analysis of Step 2, we have proven that a particular row in A has at least 10 meetings with probability 
> 1/ (2m). That is, every node in G, has a degree of at least 10 with probability > 1 /2. Then it follows that 
every row in A has at least 10 meetings with probability close to 1 — m^®'^', i.e., every node in G has a 
degree of at least 10 with probability 1 — m^®^^\ Finally, let us define CommGraph. 

Definition 13 CommGraph is obtained by concatenating, one by one in time ofD+ 1 identical copies of A. 

The nodes will be able to communicate over CommGraph in time and synchronize their clocks' drifts. The 
synchronization scheme and the proof is given by the following 'Synchronization Algorithm.' ■ 



C Our Model Can Handle Different Clocks' Speeds with Bounded Ratio 

Here are the technical details that explain why our model is realistic even if processors have somewhat 
different clock speeds. For m processors, let their clock ticks be {ti , T2, . . . , Tm}. Let Tmin^'^max be minimum, 
maximum of the set T2, • • • , Tm}, respectively. The clock ticks are in general different, but the ratio 
'^max/'^min ^ c is boundcd by some constant c, and each processor knows that upper bound c. Let Tf^Qfif^ be 
the lower bound on the time necessary for the transmission, i.e., on the time necessary for communication 
and synchronization between two processors.lt is also assumed that the lower bound on Ttrans is known to 
all processors. Now, knowing c and Ttrans^ each processor / counts kj clock ticks as a single time step Sj such 
that ki is defined by si = kiXj > IXtrans- In other words, each processor enables the condition necessary for 
the communication, making its time step Si > IXtrans- It follows that if two processors / and j overlap for a 
period of time > ^mm{si,Sj}, then they can communicate. 

For the purposes of analysis only, we assume that there is a global time axis, and time is mapped to the set 
of non-negative reals. Note that there is no a real global time, i.e., neither processors know nor need a real 
global time clock. Now we define a "time unit" on the global time axis to be 5s„u,jf, and we call 5s,nax a 
single unit of the "global time." 



Fact 14 For every processor i that works within a single global "time unit" there are at least three complete 
"time steps" that this processor's radio is awake. 

Proof For the processor / that starts working at some time v,- G [0,^'!) within a single unit time u, the 
following is satisfied v; + 3 < 5, since we had previously defined the global time unit to be Ssmax- ■ 



Claim 15 If two processors i,j work within the same global time unit, then they can communicate and can 
synchronize. 

Proof Let us consider one time unit u (with the length of 5s,nax)- Let v,- S [0,^',) be the time where the 
processor / starts working within the unit u, and analogously let vj G [0,5y ) be the time where the processor 
J starts working within the unit u. We argue that if they both happen to be awake in the same time unit, there 
is an overlap of time > ^mm{si,Sj) when they both work and hence can communicate and can synchronize. 

The processors ij certainly start working at times v,- G [0,5,), Vj G [0,5'j), respectively, and then continue 
working over the period u. Let st > sj (the other case is symmetric). By the Claim[T5]both processors work 
at least for three full-time steps. Then it follows that there exist time instances a> b within the unit u such 
that: / works over periods [a — T/,a] and [a,a + T,]; and b & [a — Si,a] and j works for over periods [b — Sj,b] 
and [b,b + Sj]. We purse the analysis as follows. 

If a — b > Sj then j is entirely covered by /, otherwise let us consider the case a — b<Sj. Further, let us 
consider two time intervals [a — Si,a] and [a,a + Si\ when / works, as well as two intervals [b — Sj,b] and 
[b,b + Sj] when j works. Let us consider the differences b + Sj — a = tj — {a — b) and (a + Si) — {b + Sj) = 
{si — Sj) + {a — b). If {a — b) < Sj/2 then the first interval is > Sj/2, otherwise the other interval is > Sj/2, 
what completes the proof of the claim. ■ 



D Synchronization 

Every processor / has its own identification IDi, which is a random number. Let the number of random bits, 
representing an /D,, be much larger then logm. Then it follows that all /D,'s are different, with probability 
arbitrarily close to one. Further, we will use the terms node and processor interchangeably. 

Every node knows m, so that it can compute D < O(logm). Also, every node / keeps the following variables: 
Max{i), a set of neighbors N eighbors{i) , and RootTime{i). Besides RootTime{i), a node keeps its own 
local-clock time, OwnTime{i). Here we explain the variables. 

At the beginning, the initialization for any node / is the following. Every node assumes that it has the max- 
imum ID, that is Max{i) = /D,, at the beginning. The set of neighbors Neighbors{i) is the set of neighbors 
/ in the CommGraph, G, and the set of neighbors Neighbors{i) is known to /. The RootTime{i) is set to 
the node's current time, RootTime{i) = OwnTime{i). Every node / performs the following Synchronization 
Algorithm, defined below. 

Synchronization Algorithm 



1. Do ONCE Subroutine T. 



Subroutine T: Transmitting only 

Send its Max{i) and RootTime{i) to the set of its neighbors N eighbors{i) . 
End of Subroutine T 

1. Repeat D+ 1 times Subroutine LT. 

Subroutine LT: Listening and Transmitting 

If the node / hears, from a node j, an Max{j) higher then Max{i) then: 

LT. 1 . Set Max{i) := Max{i) . 

Explanation: Propagate Max{ID). 

LT.2. Set the new RootTime{i) := RootTime{j) + Af^. 

Explanation: i must update the time, RootTime{i), of the node with 'the maximal ID,' Max{i). Fur- 
thermore, Afr is a transmission time of the message RootTime(j), sent from j to i. Furthermore, we 
assume that Af^ is the fixed transmission time for any i / j, and the message RootTime(j) is transmit- 
ted during that period of time. 

LT.3. Propagate the new Max{i) and RootTime{i) to Neighbors{i) \ {;'}• 

Explanation: Let all other nodes, but j, know about the recent updates RootTime{i) and Max{i). 

Else, if the node / hears from a node j, an ID < Max{i) then do nothing. 
End of Subroutine LT 

3. Do ONCE Subroutine C. 

Subroutine C: Set the Clock 

Set own clock to the time of the node with the maximal ID, i.e., OwnTime{i) =RootTime{i). 
End of Subroutine C 

End of Synchronization Algorithm 

The communication over the graph CommGraph is possible for every node, since CommGraph is built as a 
concatenation in time of £) + 1 identical copies of G. 

Let us now prove the correctness of the algorithm. With high probability all /D;'s are different. There is 
the unique node, let us call it root = max,g[^] /£),•. We have to show that all nodes in the network, after the 
synchronization algorithm, have the same time, synchronized to the time of the node root. Let us consider 
any node / in the network. Since the graph distance between the root and / is less than equal to the diameter 
D, it follows that the entire synchronization procedure will be done in D + 1 steps, and all nodes will know 
the time of the root. After Subroutine C (Set the Clock), all nodes will set their own clocks, OwnTime{i) = 
RootTime{i), all being equal to OwnTime{root) with high probabiUty. This proves the correctness of the 
Synchronization Algorithm. 



